Ch.13 Determinants

Return to TOC

Properties

Recall an $n\times n$ matrix $T$ is nonsingular iff

any system $T\vec{x}=\vec{b}$ has a unique solution
Gauss-Jordan reduction of $T$ yields $I$
the rows of $T$ form a linearly independent set
the columns of $T$ form a linearly independent set
any map that $T$ represents is an isomorphism
an inverse matrix $T^{-1}$ exists

We work towards a formula to determine if a matrix is nonsingular

For a $1\times1$ matrix

$(a)$ is trivially nonsingular iff $a\ne0$

For a $2\times2$ matrix

$\begin{pmatrix}a&b\\c&d\end{pmatrix}$ is nonsingular iff $ad-bc\ne0$

For a $3\times3$ matrix

$\begin{pmatrix}a&b&c\\d&e&f\\g&h&i\end{pmatrix}$ is nonsingular iff $aei+bfg+cdh-hfa-idb-gec\ne0$

We call the family of formulas $a$ , $ad-bc$ , etc for each $n\times n$ matrix.
The determinant function $\text{det}_{n\times n}:\mathcal{M}_{n\times n}\to\mathbb{R}$ is defined for each $n$ such that an $n\times n$ matrix $T$ is nonsingular iff $\text{det}_{n\times n}(T)\ne0$

Based on the first three determinants, we extrapolate conditions that the determinant function must satisfy:
if $T$ has rows $\vec{\rho}_1,...,\vec{\rho}_n$

$\text{det}(\vec{\rho}_1,...,k\vec{\rho}_i+\vec{\rho}_j,...,\vec{\rho}_n)=\text{det}(\vec{\rho}_1,...,\vec{\rho}_j,...,\vec{\rho}_n)$ for $i\ne j$
(row combination operations don't change the determinant)
$\text{det}(\vec{\rho}_1,...,\vec{\rho}_j,...,\vec{\rho}_i,...,\vec{\rho}_n)=-\text{det}(\vec{\rho}_1,...,\vec{\rho}_i,...,\vec{\rho}_j,...,\vec{\rho}_n)$ for $i\ne j$
(swapping rows makes the determinant negative)
$\text{det}(\vec{\rho}_1,...,k\vec{\rho}_i,...,\vec{\rho}_n)=k\cdot\text{det}(\vec{\rho}_1,...,\vec{\rho}_i,...,\vec{\rho}_n)$ for any scalar $k$ (including $k=0$ )
(multiplying a row by $k$ , multiplies the determinant by $k$ )
$\text{det}(I)=1$ for identity matrix $I$

We often write $|T|$ instead of $\text{det}(T)$

Small Note

(2) is redundant because
$T\xrightarrow{\rho_i+\rho_j}\space\xrightarrow{-\rho_j+\rho_i}\space\xrightarrow{\rho_i+\rho_j}\space\xrightarrow{-\rho_i}\hat{T}$
This swaps the rows, and the first three operations don't change the determinant while the last negates it.

From above, we can derive these lemmas:

A matrix with two identical rows has a determinant of $0$ .
A matrix with a zero row has a determinant of $0$ .
A matrix is nonsingular iff its determinant is nonzero.
The determinant of an echelon form matrix is the product down its diagonal.

Proof

For the first, swap the two identical rows. By condition (2), the determinant is the opposite but the matrix is the same, so it must be $0$ .

For the second, multiply the zero row by $2$ . By condition (3), the determinant is doubled, but the matrix remains the same, so the determinant is the same. Thus it must be $0$ .

The third is by definition.

The fourth sentence has two cases: if it is singular, then it has a zero row, and contains a $0$ in the diagonal. The determinant is $0$ since it is singular, and equals the product down the diagonal.
If the echelon form matrix is nonsingular then none of the diagonal entries are $0$ . We can then use condition (3) to get ones in the diagonal:
$\left|\begin{matrix}t_{1,1}&t_{1,2}&\cdots&t_{1,n}\\0&t_{2,2}&&t_{2,n}\\\vdots\\0&0&&t_{n,n}\end{matrix}\right|=t_{1,1}\cdot t_{2,2}\cdots t_{n,n}\cdot\left|\begin{matrix}1&t_{1,2}/t_{1,1}&\cdots&t_{1,n}/t_{1,1}\\0&1&&t_{2,n}/t_{2,2}\\\vdots\\0&0&&1\end{matrix}\right|$
Then, clearing out the columns uses condition (1), so
$=t_{1,1}\cdot t_{2,2}\cdots t_{n,n}\cdot |I|=t_{1,1}\cdot t_{2,2}\cdots t_{n,n}$
So, the determinant is the product down the diagonal in this case as well.

With these rules, we can find the determinant using Gauss's Method

Example 13.1

Using Gauss's Method, find the determinant of the matrix
$\begin{pmatrix}1&3&-2\\2&0&4\\3&-1&5\end{pmatrix}$

$\begin{vmatrix}1&3&-2\\2&0&4\\3&-1&5\end{vmatrix}=\begin{vmatrix}1&3&-2\\0&-6&-8\\0&-10&-11\end{vmatrix}=\begin{vmatrix}1&3&-2\\0&-6&-8\\0&0&-7/3\end{vmatrix}=1\cdot(-6)\cdot(-7/3)=14$

Example 13.2

Find the determinant of
$\begin{pmatrix}0&3&1\\1&2&0\\1&5&2\end{pmatrix}$

Illustrate condition (2)
$\begin{vmatrix}0&3&1\\1&2&0\\1&5&2\end{vmatrix}=-\begin{vmatrix}1&2&0\\0&3&1\\1&5&2\end{vmatrix}=-\begin{vmatrix}1&2&0\\0&3&1\\0&3&2\end{vmatrix}=-\begin{vmatrix}1&2&0\\0&3&1\\0&0&1\end{vmatrix}=-1\cdot3\cdot1=-3$

For a $2\times2$ determinant, notice how the terms are diagonals of the matrix.
In a $3\times3$ matrix, the following mnemonics can be used

For larger matrices, use Gauss's Method

An upper triangular matrix is a square matrix with only $0$ s below the diagonal.
$\begin{pmatrix}t_{1,1}&t_{1,2}&\cdots&t_{1,n}\\0&t_{2,2}&\cdots&t_{2,n}\\\vdots&\vdots&\ddots&\vdots\\0&0&\cdots&t_{n,n}\end{pmatrix}$

The determinant of an upper triangular matrix $T$ is the product of the diagonals.

Proof

If the diagonal entries are all nonzero, the matrix is in echelon form, so the determinant is the product of the diagonals.
If $t_{i,i}=0$ for some $i\in\{1,...,n\}$ , we must prove that $|T|=0$ . This is true iff $T$ is singular, so it suffices to show that the columns of $T$ are linearly dependent. Consider the matrix formed by the first $i$ columns. It has the form
$\begin{pmatrix}t_{1,1}&t_{1,2}&\cdots&t_{1,i-1}&t_{1,i}\\0&t_{2,2}&\cdots&t_{2,i-1}&t_{2,i}\\\vdots&\vdots&\ddots&\vdots&\vdots\\0&0&\cdots&t_{i-1,i-1}&t_{i-1,1}\\0&0&\cdots&0&0\\\vdots&\vdots&\ddots&\vdots&\vdots\\0&0&\cdots&0&0\end{pmatrix}$
Only the first $i-1$ rows can be nonzero, therefore this matrix is singular. That implies the columns are linearly dependent, but the columns are a subset of the columns of $T$ , so the columns of $T$ must be linearly dependent.

Permutation Expansion

Existance and Uniqueness

The problem with using conditions to define a function is that we must verify that there is one and only one function that satisfies those conditions.
We do that by determining a well-defined formula.

First, prove its uniqueness:
For each $n$ , if there is an $n\times n$ determinant function than it is unique.

Proof

Suppose there existed two functions $\text{det}_1,\text{det}_2:\mathcal{M}_{n\times n}\to\mathbb{R}$ satisfying the four conditions. Given a square matrix $M$ , we can fix some way to reduce it to echelon form, keeping track of sign changes and scaler factors, then multiplying down the diagonal at the end, we can see that both functions must return the same result. Since they return the same output for every function, they are the same function.

Let $V$ be a vector space. A map $f:V^n\to\mathbb{R}$ is multilinear if

$f(\vec{\rho}_1,...,\vec{v}+\vec{w},...,\vec{\rho}_n)=f(\vec{\rho}_1,...,\vec{v},...,\vec{\rho}_n)+f(\vec{\rho}_1,...,+\vec{w},...,\vec{\rho}_n)$
(the function splits addition one input at a time)
$f(\vec{\rho}_1,...,k\vec{v},...,\vec{\rho}_n)=k\cdot f(\vec{\rho}_1,...,\vec{v},...,\vec{\rho}_n)$
(the function splits scalar multiples one input at a time)

Determinants are multilinear

Proof

The second of the two properties is simply condition (3) from above.
For the first condition, there are two conditions:
If the set of the other rows $\{\vec{\rho}_1,...,\vec{\rho}_{i-1},\vec{\rho}_{i+1},...,\vec{\rho}_n\}$ is linearly dependent, all three matrices are singular so we get the trivial $0=0$ .
Therefore, assume the set of other rows is linearly independent. Then we can add another vector to make a basis:
$\{\vec{\rho}_1,...,\vec{\rho}_{i-1},\vec{\beta},\vec{\rho}_{i+1},...,\vec{\rho}_n\}$
Then $\vec{v}$ and $\vec{w}$ can be expressed with respect to this basis and added:
$\vec{v}=v_1\vec{\rho}_1+\cdots+v_{i-1}\vec{\rho}_{i-1}+v_i\vec{\beta}+v_{i+1}\vec{\rho}_{i+1}+\cdots+v_n\vec{\rho}_n\\\vec{w}=w_1\vec{\rho}_1+\cdots+w_{i-1}\vec{\rho}_{i-1}+w_i\vec{\beta}+w_{i+1}\vec{\rho}_{i+1}+\cdots+w_n\vec{\rho}_n\\\vec{v}+\vec{w}=(v_1+w_1)\vec{\rho}_1+\cdots+(v_i+w_i)\vec{\beta}+\cdots+(v_n+w_n)\vec{\rho}_n$
Now substitute this into the left-hand side of property 1
$\text{det}(\vec{\rho}_1,...,(v_1+w_1)\vec{\rho}_1+\cdots+(v_i+w_i)\vec{\beta}+\cdots+(v_n+w_n)\vec{\rho}_n,...,\vec{\rho}_n)$
From condition (1), the determinant doesn't change if we add $-(\vec{v}_j+\vec{w}_j)$ times $\vec{\rho}_j$ , so doing that we can apply condition (3) to get
$\begin{array}{rcl}\text{det}(\vec{\rho}_1,...,\vec{v}+\vec{w},...,\vec{\rho}_n)&=&\text{det}(\vec{\rho}_1,...,(v_i+w_i)\vec{\beta},...,\vec{\rho}_n)\\&=&(\vec{v}+\vec{w})\cdot\text{det}(\vec{\rho}_1,...,\vec{\beta},...,\vec{\rho}_n)\\&=&v_i\cdot\text{det}(\vec{\rho}_1,...,\vec{\beta},...,\vec{\rho}_n)+w_i\cdot\text{det}(\vec{\rho}_1,...,\vec{\beta},...,\vec{\rho}_n)\\&=&\text{det}(\vec{\rho}_1,...,v_i\cdot\vec{\beta},...,\vec{\rho}_n)+\text{det}(\vec{\rho}_1,...,w_i\cdot\vec{\beta},...,\vec{\rho}_n)\end{array}$
Now add the $v_j\vec{\rho}_j$ 's to the first and $w_j\vec{\rho}_j$ 's to the second to recreate $\vec{v}$ and $\vec{w}$ , giving the desired expression.

Example 13.3

The determinant of
$\begin{pmatrix}1&2\\3&4\end{pmatrix}$ can be split using multilinearity:
$\begin{vmatrix}1&2\\3&4\end{vmatrix}=\begin{vmatrix}0&2\\3&4\end{vmatrix}+\begin{vmatrix}1&0\\3&4\end{vmatrix}=\begin{vmatrix}1&0\\3&0\end{vmatrix}+\begin{vmatrix}1&0\\0&4\end{vmatrix}+\begin{vmatrix}0&2\\3&0\end{vmatrix}+\begin{vmatrix}0&2\\0&4\end{vmatrix}$
The left and right matrices are $0$ becuase the second row is a scalar multiple of the first row.
$=4\cdot\begin{vmatrix}1&0\\0&1\end{vmatrix}+2\cdot3\cdot\begin{vmatrix}0&1\\1&0\end{vmatrix}$
We discuss evaluating these matrices below.

Example 13.4

The determinant can be reduced to a sum of determinants where each row has one element from the original.
$\begin{vmatrix}1&2&3\\4&5&6\\7&8&9\end{vmatrix}=\begin{vmatrix}1&0&0\\4&0&0\\7&0&0\end{vmatrix}+\begin{vmatrix}1&0&0\\4&0&0\\0&8&0\end{vmatrix}+\cdots+\begin{vmatrix}0&0&3\\0&0&6\\0&8&0\end{vmatrix}+\begin{vmatrix}0&0&3\\0&0&6\\0&0&9\end{vmatrix}$
If any two entries came from the same column, then one row is a multiple of another so the determinant is zero. This results in $6$ determinants:
$\begin{vmatrix}1&2&3\\4&5&6\\7&8&9\end{vmatrix}=\begin{vmatrix}1&0&0\\0&5&0\\0&0&9\end{vmatrix}+\begin{vmatrix}1&0&0\\0&0&6\\0&8&0\end{vmatrix}+\begin{vmatrix}0&2&0\\4&0&0\\0&0&9\end{vmatrix}+\begin{vmatrix}0&2&0\\0&0&6\\7&0&0\end{vmatrix}+\begin{vmatrix}0&0&3\\4&0&0\\0&8&0\end{vmatrix}+\begin{vmatrix}0&0&3\\0&5&0\\7&0&0\\\end{vmatrix}\\=45\cdot\begin{vmatrix}1&0&0\\0&1&0\\0&0&1\end{vmatrix}+48\cdot\begin{vmatrix}1&0&0\\0&0&1\\0&1&0\end{vmatrix}+72\cdot\begin{vmatrix}0&1&0\\1&0&0\\0&0&1\end{vmatrix}+84\cdot\begin{vmatrix}0&1&0\\0&0&1\\1&0&0\end{vmatrix}+96\cdot\begin{vmatrix}0&0&1\\1&0&0\\0&1&0\end{vmatrix}+105\cdot\begin{vmatrix}0&0&1\\0&1&0\\1&0&0\\\end{vmatrix}$
We discuss evaluating these matrices below.

Permutation Matrices

A permutation matrix is a matrix where every entry is $0$ except for a single $1$ in each row and column.
Define an $n$ -permutation as a function on the first $n$ integers $\phi:\{1,...,n\}\to\{1,...,n\}$ that is bijective.
In other words, each of $1,...,n$ in the output is associated with exactly one input.

Example 13.5

The $3$ -permutations are
$\begin{array}{ccc}\phi_1=\{1,2,3\}&\phi_2=\{1,3,2\}&\phi_3=\{2,1,3\}\\\phi_4=\{2,3,1\}&\phi_5=\{3,1,2\}&\phi_6=\{3,2,1\}\end{array}$

We denote the row matrix with all $0$ s except for a $1$ in entry $j$ with $\iota_j$ (e.g. the four-wide $\iota_2=\begin{pmatrix}0&2&0&0\end{pmatrix}$ ). With this, we notate a permutation matrix such that a $\phi=\langle\phi(1),...,\phi(n)\rangle$ is associated with the matrix whose rows are $\iota_{\phi(1)},...,\iota_{\phi(n)}$
For example, for the $4$ -permutation $\phi:\langle2,4,3,1\rangle$ , the matrix associated with it is
$P_\iota=\begin{pmatrix}\iota_2\\\iota_4\\\iota_3\\\iota_1\end{pmatrix}=\begin{pmatrix}0&1&0&0\\0&0&0&1\\0&0&1&0\\1&0&0&0\end{pmatrix}$

Now we can define the permutation expansion for determinants:
$\begin{vmatrix}t_{1,1}&t_{1,2}&\cdots&t_{1,n}\\t_{2,1}&t_{2,2}&\cdots&t_{2,n}\\\vdots&\vdots&\ddots&\vdots\\t_{n,1}&t_{n,2}&\cdots&t_{n,n}\end{vmatrix}=\begin{array}{c}t_{1,\phi_1(1)}\cdot t_{2,\phi_1(2)}\cdots t_{n,\phi_1(n)}|P_{\phi_1}|\\\\+t_{1,\phi_2(1)}\cdot t_{2,\phi_2(2)}\cdots t_{n,\phi_2(n)}|P_{\phi_2}|\\\vdots\\+t_{1,\phi_k(1)}\cdot t_{2,\phi_k(2)}\cdots t_{n,\phi_k(n)}|P_{\phi_k}|\end{array}$
where $\phi_1,...,\phi_k$ are the $n$ -permutations
In summation notation,
$|T|=\sum_{\text{permutations }\phi}t_{1,\phi_1(1)}\cdot t_{2,\phi(2)}\cdots t_{n,\phi(n)}|P_{\phi}|$
the sum of all permutations of $\phi$ of the form $t_{1,\phi_1(1)}t_{2,\phi(2)}\cdots t_{n,\phi(n)}|P_{\phi}|$

Example 13.6

Consider a $2\times2$ matrix. There are two $2$ -permutations, $\phi_1=\langle1,2\rangle$ and $\phi_2=\langle2,1\rangle$ . The associated permutation matrices are
$\begin{array}{cc}P_{\phi_1}=\begin{pmatrix}1&0\\0&1\end{pmatrix}&P_{\phi_2}=\begin{pmatrix}0&1\\1&0\end{pmatrix}\end{array}$
So we get the expansion
$\begin{array}{rcl}\begin{vmatrix}a&b\\c&d\end{vmatrix}&=&ad\cdot\begin{vmatrix}1&0\\0&1\end{vmatrix}+bc\cdot\begin{vmatrix}0&1\\1&0\end{vmatrix}\\&=&ad\cdot(1)+bc\cdot(-1)\\&=&ad-bc\end{array}$
This gives the familiar formula for the determinant of the $2\times2$ matrix.
Note that $\begin{vmatrix}0&1\\1&0\end{vmatrix}=-1$ because it involves one row swap from $I$

Theorem: For each $n$ there is a $n\times n$ determinant function.
Theorem: The determinant of a matrix equals the determinant of its transpose.
This means statements about rows can be applied to the columns too; e.g. if row combinations don't change the determinant, column combinations don't either.
Also, a matrix is singular if two columns are equal, swapping columns changes the sign, and determinants are multilinear in their columns.

If $T$ is a lower triangular matrix, the determinant is still the product of the diaognal entries.
This is true because $T^T$ has the same diagonals and is an upper triangular matrix, whose determinants are the product of the diagonal, and $|T|=|T^T|$

Existance of Determinants

In a permutation $\phi=\langle...,k,...,j,...\rangle$ or permutation matrix
$P_\phi=\begin{pmatrix}\vdots\\\iota_k\\\vdots\\\iota_j\\\vdots\end{pmatrix}$
elements or rows such that $k>j$ or $\iota_k>\iota_j$ are in an inversion
$\phi=\langle3,2,1\rangle$ has 3 inversions: $3>2$ , $2>1$ , $3>1$

A row swap in a permutation matrix changes the parity of the number of inversions.

Proof

If the rows are adjacent, swapping the two won't affect the inversions of any other element, so it changes the number of inversions by 1.
If they are not, then swap them via a sequence, starting with bringing row $k$ up
$\begin{pmatrix}\vdots\\\iota_{\phi(j)}\\\iota_{\phi(j+1)}\\\vdots\\\iota_{\phi(k)}\\\vdots\end{pmatrix}\xrightarrow{\rho_k\leftrightarrow\rho_{k-1}}\space\xrightarrow{\rho_{k-1}\leftrightarrow\rho_{k-2}}\space\cdots\space\xrightarrow{\rho_{j+1}\leftrightarrow\rho_j}\begin{pmatrix}\vdots\\\iota_{\phi(k)}\\\iota_{\phi(j)}\\\vdots\\\iota_{\phi(k-1)}\\\vdots\end{pmatrix}$
then moving row $j$ down
$\xrightarrow{\rho_{j+1}\leftrightarrow\rho_{j+2}}\space\xrightarrow{\rho_{j+2}\leftrightarrow\rho_{j+3}}\space\cdots\space\xrightarrow{\rho_{k-1}\leftrightarrow\rho_k}\begin{pmatrix}\vdots\\\iota_{\phi(k)}\\\iota_{\phi(j+1)}\\\vdots\\\iota_{\phi(j)}\\\vdots\end{pmatrix}$
The total number of swaps is $(k-j)+(k-j-1)=2(k-j)-1$ , which is odd, so it changes the parity of the number of inversions

The signum of a permutation $\text{sgn}(\phi)$ is:
$\text{sgn}(\phi)=\begin{cases}-1&\text{if number of inversions is odd}\\+1&\text{if number of inversions is even}\end{cases}$

If $\text{sgn}(\phi)=-1$ , it takes an odd number of swaps to take it back to identity, and if $\text{sgn}(\phi)=1$ , it takes an even number (should be pretty intuitive)
$\implies$ $|P_\phi|=\text{sgn}(\phi)$ because a row swap changes the sign and $|I|=1$
Thus, the permutation expansion becomes
$d(T)=\sum_{\text{permutations }\phi}t_{1,\phi_1(1)}\cdot t_{1,\phi(2)}\cdots t_{n,\phi(n)}\cdot\text{sgn}(\phi)$
The signum function is clearly well-defined: just count the number of inversions.
So finally, we will show that this $d(T)$ satisfies the conditions, proving that the determinant exists for all $n$ .

Proof

Condition (4) is easy: for $I$ , the summation is all $0$ except for the permutation which gives the product down the diagonal, which is $1$ .
For condition (3), suppose $T\xrightarrow{k\rho_i}\hat{T}$ and consider $d(\hat{T})$
$\sum_{\text{perm }\phi}\hat{t}_{1,\phi(1)}\cdots\hat{t}_{i,\phi(i)}\cdots\hat{t}_{n,\phi(n)}\text{sgn}(\phi)=\sum_{\text{perm }\phi}t_{1,\phi(1)}\cdots kt_{i,\phi(i)}\cdots t_{n,\phi(n)}\text{sgn}(\phi)\\=k\sum_{\text{perm }\phi}t_{1,\phi(1)}\cdots kt_{i,\phi(i)}\cdots t_{n,\phi(n)}\text{sgn}(\phi)=k\cdot d(T)$
leaves the desired equality
For condition (2), suppose $T\xrightarrow{\rho_i\leftrightarrow\rho_j}\hat{T}$ . We must show $d(\hat{T})=-d(T)$
$\hat{t}_{i,\phi(i)}$ and $\hat{t}_{j,\phi(j)}$ in each sum of the permutation expansion
$\sum_{\text{perm }\phi}\hat{t}_{1,\phi(1)}\cdots\hat{t}_{i,\phi(i)}\cdots\hat{t}_{j,\phi(j)}\cdots\hat{t}_{n,\phi(n)}\text{sgn}(\phi)$
is taken by swapping the rows in $T$ , which as we have established before flips the sign. Thus,
$\sum_{\text{perm }\phi}\hat{t}_{1,\phi(1)}\cdots\hat{t}_{i,\phi(i)}\cdots\hat{t}_{j,\phi(j)}\cdots\hat{t}_{n,\phi(n)}\cdot\text{sgn}(\phi)=\sum_{\text{perm }\phi}t_{1,\phi(1)}\cdots t_{i,\phi(i)}\cdots t_{j,\phi(j)}\cdots t_{n,\phi(n)}\cdot(-\text{sgn}(\phi))\\ =-d(T)$
For condition (1), suppose $T\xrightarrow{k\rho_i+\rho_j}\hat{T}$
$d(\hat{T})=\sum_{\text{perm }\phi}\hat{t}_{1,\phi(1)}\cdots t_{i,\phi(i)}\cdots\hat{t}_{j,\phi(j)}\cdots\hat{t}_{n,\phi(n)}\text{sgn}(\phi)\\ =\sum_{\text{perm }\phi} t_{1,\phi(1)}\cdots t_{i,\phi(i)}\cdots(kt_{i,\phi(j)}+t_{j,\phi(j)})\cdots t_{n,\phi(n)}\text{sgn}(\phi)$
Distributing over addition and breaking into two summations:
$=k\cdot\sum_{\text{perm }\phi}t_{1,\phi(1)}\cdots t_{i,\phi(i)}\cdots t_{i,\phi(j)}\cdots t_{n,\phi(n)}\text{sgn}(\phi)\\+\sum_{\text{perm }\phi}t_{1,\phi(1)}\cdots t_{i,\phi(i)}\cdots t_{j,\phi(j)}\cdots t_{n,\phi(n)}\text{sgn}(\phi)$
See that the second term is $d(T)$ .
In the first term, the entry is $t_{i,\phi(j)}$ , not $t_{j,\phi(j)}$ . This sum represents the determinant of a matrix $S$ that is equal to $T$ except row $j$ of $S$ is row $i$ of $T$ , giving $S$ two copies of row $i$ . Thus, the first term is $0$ , making $d(\hat{T})=d(T)$ as desired.

Thus, we have that for any $n$ , there exists a determinant function $\mathcal{M}_{n\times n}\to\mathbb{R}$
Finally, we can show $|T|=|T^T|$ using the expansion.
$|T|=\sum_{\text{perm }\phi}t_{1,\phi(1)}\cdots t_{i,\phi(i)}\cdots t_{j,\phi(j)}\cdots t_{n,\phi(n)}\text{sgn}(\phi)$
In $T^T$ , the $t_{a,\phi(b)}$ 's are all the same, since for $|T|$ we have all ways to take one entry from each row and column of $T$ and $T^T$ has all ways to take one entry from each column and row of $T$ . So, the only difference is in $\text{sgn}(\phi)$ , but $\text{sgn}(\phi)=\text{sgn}(\phi^{-1})$ , so they are the same.

Determinants as Size Functions

A box or parallelepiped in $\mathbb{R}^n$ formed by $\langle\vec{v}_1,...,\vec{v}_n\rangle$ is the set $\{t_1\vec{v}_1+\cdots+t_n\vec{v}_n\space|\space t_1,...,t_n\in[0,1]\}$

A parallelepiped in

\mathbb{R}^2

The determinant of the $2\times 2$ matrix
$\begin{vmatrix}x_1&x_2\\y_1&y_2\end{vmatrix}$
represents the area of a parallelepiped in $\mathbb{R}^2$

Geometric Interpretations

Recall that the transpose does not change the determinant, so column operations are valid operations (just transposed row operations).
Also recall that scaling a column (or equivalently, a row) by $k$ scales the whole determinant by $k$ .
This makes sense, as it is analogous to scaling a side length of the box.

For the condition stating row combinations (or equivalent column combinations) does not change the determinant.

The base is the same, and the slant is different, but the height is the same, so the area remains the same.

Also, it is clear that the identity matrix has determinant of $1$ . A box made from $\begin{pmatrix}1\\0\end{pmatrix}$ and $\begin{pmatrix}0\\1\end{pmatrix}$ has area $1$ .

Swapping the vectors should negate the area. But area is positive, so the determinant sign of the determinant reflects the orientation or sense of the box.
This gives the right-hand rule in $\mathbb{R}^3$ : do a "thumbs up" with your right hand and place it on the spanning plane so that your fingers curl from $\vec{v}_1$ to $\vec{v}_2$ . Vectors on the side with the thumb define positive-sized boxes.

The determinant of the product of two matrices is the product of the determinants
$|TS|=|T||S|$

Proof

First, suppose $T$ is singular and has no inverse. If $TS$ is invertible, then there exists some $M$ such that $(TS)M=T(SM)=I$ , meaning $T$ must be invertible. The contrapositive says if $T$ is not invertible then neither is $TS$ , so $|T||S|=|TS|=0$ .
If $T$ is invertible, Then it is a product of elementary matrices $T=E_1E_2\cdots E_r$ . Showing $|ES|=|E||S|$ for all matrices $S$ and elementary matrices $E$ proves the result.
For $M_i(k)$ , the matrix multiplying row $i$ by $k$ , we have $|M_i(k)|=k|I|=k$ from condition three, but also $|M_i(k)S|=kS$ also from condition three. The case for the other two elementary matrices is similar.

From above, we can derive the determinant of the inverse:
$1=|I|=|TT^{-1}|=|T||T^{-1}|\implies|T^{-1}|=\frac{1}{|T|}$

The volume of a box is the absolute value of the determinant of a matrix with those vectors as columns.

Cramer's Rule

Recall that a linear system is equivalent to a linear vector equation.
$\begin{array}{ccc}\begin{array}{c}x_1+2x_2=6\\3x_1+x_2=8\end{array}&\iff&x_1\begin{pmatrix}1\\3\end{pmatrix}+x_2\begin{pmatrix}2\\1\end{pmatrix}=\begin{pmatrix}6\\8\end{pmatrix}\end{array}$
The geometric interpretation is to find what factors $x_1$ and $x_2$ must we scale the sides of the parallelogram so that it will fill the other vector.

Consider expanding only one side of the parallelogram, and compare the sizes of the shaded rectangles.

Together, we have
$x_1\begin{vmatrix}1&2\\3&1\end{vmatrix}=\begin{vmatrix}x_1\cdot1&2\\x_1\cdot3&1\end{vmatrix}=\begin{vmatrix}x_1\cdot1+x_2\cdot2&2\\x_1\cdot3+x_2\cdot1&1\end{vmatrix}=\begin{vmatrix}6&2\\8&1\end{vmatrix}$
So dividing both sides,
$x_1=\frac{\begin{vmatrix}6&2\\8&1\end{vmatrix}}{\begin{vmatrix}1&2\\3&1\end{vmatrix}}=\frac{-10}{-5}=2$

This gives a new way to solve systems of equations.

Cramer's Rule
Let $A$ be an $n\times n$ matrix with nonzero determinant, let $\vec{b}$ be an $n$ -tall column vector, consider the linear system $A\vec{x}=\vec{b}$ . For any $i\in[1,...,n]$ let $B_i$ be the matrix obtained by substituting $\vec{b}$ for column $i$ in $A$ . Then the value of the $i$ -th unknown is $x_i=|B_i|/|A|$ .
If the matrix has a zero determinant then the system has no solution.

Example 13.7

Solve the following system of equations
$\begin{array}{rcrcrcl}2x_1&+&x_2&-&x_3&=&4\\x_1&+&3x_2&&&=&2\\&&x_2&-&5x_3&=&0\end{array}$

The corresponding matrix equation is
$\begin{pmatrix}2&1&-1\\1&3&0\\0&1&-5\end{pmatrix}\begin{pmatrix}x_1\\x_2\\x_3\end{pmatrix}=\begin{pmatrix}4\\2\\0\end{pmatrix}$
We can find
$\begin{array}{cc}|A|=\begin{vmatrix}2&1&-1\\1&3&0\\0&1&-5\end{vmatrix}=-26&|B_1|=\begin{vmatrix}4&1&-1\\2&3&0\\0&1&-5\end{vmatrix}\end{array}=-52$
$\begin{array}{cc}|B_2|=\begin{vmatrix}2&4&-1\\1&2&0\\0&0&-5\end{vmatrix}=0&|B_3|=\begin{vmatrix}2&1&4\\1&3&2\\0&1&0\end{vmatrix}=0\end{array}$
So the solutions are
$\begin{pmatrix}x_1\\x_2\\x_3\end{pmatrix}=\begin{pmatrix}|B_1|\space/\space|A|\\|B_2|\space/\space|A|\\|B_3|\space/\space|A|\end{pmatrix}=\begin{pmatrix}2\\0\\0\end{pmatrix}$

Note that because this method requires taking the determinant, it is generally much slower to use Cramer's Rule for large matrices.

Laplace's Formula

Consider the permutation expansion
$\begin{array}{rl}\begin{vmatrix}t_{1,1}&t_{1,2}&t_{1,3}\\t_{2,1}&t_{2,2}&t_{2,3}\\t_{3,1}&t_{3,2}&t_{3,3}\end{vmatrix}=&t_{1,1}t_{2,2}t_{3,3}\begin{vmatrix}1&0&0\\0&1&0\\0&0&1\end{vmatrix}+t_{1,1}t_{2,3}t_{3,2}\begin{vmatrix}1&0&0\\0&0&1\\0&1&0\end{vmatrix}\\&+t_{1,2}t_{2,1}t_{3,3}\begin{vmatrix}0&1&0\\1&0&0\\0&0&1\end{vmatrix}+t_{1,2}t_{2,3}t_{3,1}\begin{vmatrix}0&1&0\\0&0&1\\1&0&0\end{vmatrix}\\&\space\space+t_{1,3}t_{2,1}t_{3,2}\begin{vmatrix}0&0&1\\1&0&0\\0&1&0\end{vmatrix}+t_{1,3}t_{2,2}t_{3,1}\begin{vmatrix}0&0&1\\0&1&0\\1&0&0\end{vmatrix}\end{array}$
Pick a row or column and factor out. Suppose we choose the first row
$\begin{array}{rl}\begin{vmatrix}t_{1,1}&t_{1,2}&t_{1,3}\\t_{2,1}&t_{2,2}&t_{2,3}\\t_{3,1}&t_{3,2}&t_{3,3}\end{vmatrix}=&t_{1,1}\left[t_{2,2}t_{3,3}\begin{vmatrix}1&0&0\\0&1&0\\0&0&1\end{vmatrix}+t_{2,3}t_{3,2}\begin{vmatrix}1&0&0\\0&0&1\\0&1&0\end{vmatrix}\right]\\&+t_{1,2}\left[t_{2,1}t_{3,3}\begin{vmatrix}0&1&0\\1&0&0\\0&0&1\end{vmatrix}+t_{2,3}t_{3,1}\begin{vmatrix}0&1&0\\0&0&1\\1&0&0\end{vmatrix}\right]\\&\space\space+t_{1,3}\left[t_{2,1}t_{3,2}\begin{vmatrix}0&0&1\\1&0&0\\0&1&0\end{vmatrix}+t_{2,2}t_{3,1}\begin{vmatrix}0&0&1\\0&1&0\\1&0&0\end{vmatrix}\right]\end{array}$
Using the property that a row swap changes the sign, swap the rows so that they match the first. This takes one swap for row 2 and two for row 3
$\begin{array}{rl}\begin{vmatrix}t_{1,1}&t_{1,2}&t_{1,3}\\t_{2,1}&t_{2,2}&t_{2,3}\\t_{3,1}&t_{3,2}&t_{3,3}\end{vmatrix}=&t_{1,1}\left[t_{2,2}t_{3,3}\begin{vmatrix}1&0&0\\0&1&0\\0&0&1\end{vmatrix}+t_{2,3}t_{3,2}\begin{vmatrix}1&0&0\\0&0&1\\0&1&0\end{vmatrix}\right]\\&-t_{1,2}\left[t_{2,1}t_{3,3}\begin{vmatrix}1&0&0\\0&1&0\\0&0&1\end{vmatrix}+t_{2,3}t_{3,1}\begin{vmatrix}1&0&0\\0&0&1\\0&1&0\end{vmatrix}\right]\\&\space\space+t_{1,3}\left[t_{2,1}t_{3,2}\begin{vmatrix}1&0&0\\0&1&0\\0&0&1\end{vmatrix}+t_{2,2}t_{3,1}\begin{vmatrix}1&0&0\\0&0&1\\0&1&0\end{vmatrix}\right]\end{array}$
The terms in the square brackets simplify to a $2\times2$ determinants
$=t_{1,1}\begin{vmatrix}t_{2,2}&t_{2,3}\\t_{3,2}&t_{3,3}\end{vmatrix}-t_{1,2}\begin{vmatrix}t_{2,1}&t_{2,3}\\t_{3,1}&t_{3,3}\end{vmatrix}+t_{1,3}\begin{vmatrix}t_{2,1}&t_{2,2}\\t_{3,1}&t_{3,2}\end{vmatrix}$

The $i,j$ minor for an $n\times n$ matrix $T$ is the $(n-1)\times(n-1)$ matrix formed by deleting row $i$ and column $j$ of $T$ . The $i,j$ cofactor $T_{i,j}$ of $T$ is $(-1)^{i+j}$ times the determinant of the $i,j$ minor of $T$

Example 13.8

For the matrix
$S=\begin{pmatrix}3&1&2\\5&4&-1\\7&0&-3\end{pmatrix}$
the $2,3$ minor is
$\begin{pmatrix}3&1\\7&0\end{pmatrix}$
and the cofactor is
$S_{2,3}=(-1)^{2+3}\begin{vmatrix}3&1\\7&0\end{vmatrix}=7$

Laplace's formula finds the determinant of an $n\times n$ matrix by expanding by cofactors on any row $i$ or column $j$
$|T|=t_{i,1}\cdot T_{i,1}+t_{i,2}\cdot T_{i,2}+\cdots+t_{i,n}\cdot T_{i,n}\\\space\space\space\space\space\space\space=t_{1,j}\cdot T_{1,j}+t_{2,j}\cdot T_{2,j}+\cdots+t_{n,j}\cdot T_{n,j}$

Example 13.9

Find the determinant
$\begin{vmatrix}3&1&2\\5&4&-1\\7&0&-3\end{vmatrix}$
by expanding along the second row
$\begin{vmatrix}3&1&2\\5&4&-1\\7&0&-3\end{vmatrix}=-5\begin{vmatrix}1&2\\0&-3\end{vmatrix}+4\begin{vmatrix}3&2\\7&-3\end{vmatrix}-(-1)\begin{vmatrix}3&1\\7&0\end{vmatrix}\\=-5(-3)+4(-23)+1(-7)=15-92-7=-84$

The matrix adjoint (or the classical adjoint or adjugate) of a square matrix $T$ is
$\text{adj}(T)=\begin{pmatrix}T_{1,1}&T_{2,1}&\cdots&T_{n,1}\\T_{1,2}&T_{2,2}&\cdots&T_{n,2}\\&\vdots\\T_{1,n}&T_{2,n}&\cdots&T_{n,n}\end{pmatrix}$
note that the row $i$ column $j$ entry is $T_{j,i}$ , the $j,i$ cofactor.

Example 13.10

For the same matrix
$S=\begin{pmatrix}3&1&2\\5&4&-1\\7&0&-3\end{pmatrix}$
the matrix adjoint is
$\text{adj}(S)=\begin{pmatrix}-12&3&-9\\8&-23&13\\-28&7&7\end{pmatrix}$

For a square matrix $T$ , $T\cdot\text{adj}(T)=\text{adj}(T)\cdot T=|T|\cdot I$ . In other words,
$\begin{pmatrix}t_{1,1}&\cdots&t_{1,n}\\\vdots&&\vdots\\t_{n,1}&\cdots&t_{n,n}\end{pmatrix}\begin{pmatrix}T_{1,1}&\cdots&T_{1,n}\\\vdots&&\vdots\\T_{n,1}&\cdots&T_{n,n}\end{pmatrix}=\begin{pmatrix}|T|&0&\cdots&0\\0&|T|&\cdots&0\\&\vdots\\0&0&\cdots&|T|\end{pmatrix}$

Proof

Laplace's formula directly shows the diagonal entries are $|T|$ .
For any off-diagonal entry, the multiplication gives
$t_{i,1}\cdot T_{k,1}+t_{i,2}\cdot T_{k,2}+\cdots+t_{i,n}\cdot T_{k,n}=0$
becuase it represents the expansion along row $k$ of a matrix with row $i$ equal to row $k$ , and a matrix with identical rows has determinant $0$ .

A result of this is that
$T^{-1}=\frac{1}{|T|}\text{adj}(T)$

Ch.13 Determinants

Properties

For a 1×11\times11×1 matrix

For a 2×22\times22×2 matrix

For a 3×33\times33×3 matrix

Permutation Expansion

Existance and Uniqueness

Permutation Matrices

Existance of Determinants

Determinants as Size Functions

Geometric Interpretations

Cramer's Rule

Laplace's Formula

For a $1\times1$ matrix

For a $2\times2$ matrix

For a $3\times3$ matrix